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( i ) AF!RJ>ICANT ^^frael , David 

*£j2^2^Wolfman, Neil M. 

(ii) TITLE OF INVENTION: Recombinant Bone Morphogenetic Protein 
Heterodimers, Compositions and Methods of Use. 

(iii) NUMBER OF SEQUENCES: 30 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Legal Affairs, Genetics Institute, Inc. 

(B) STREET: 87 CambridgePark Drive 

(C) CITY: Cambridge 

(D) STATE: MA 

(E) COUNTRY: USA 

(F) ZIP: 02140-2387 



(V) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Tape 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: PatentIn Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: US 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/ AGENT INFORMATION: 

(A) NAME: Kapinos, Ellen J. 

(B) REGISTRATION NUMBER: 3 2,245 

(C) REFERENCE/DOCKET NUMBER: GI-5192B 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: 617-876-1170 

(B) TELEFAX: 617-876-5851 



i) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 356.. 1543 



(xi) 



SEQUENCE DESCRIPTION: SEQ ID NO : 1 : 



(A) LENGTH: 396 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

Met Val Ala Gly Thr Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val 
15 10 15 

Leu Leu Gly Gly Ala Ala Gly Leu Val Pro Glu Leu Gly Arg Arg Lys 
20 25 30 

Phe Ala Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu 
35 40 45 

Val Leu Ser Glu Phe Glu Leu Arg Leu Leu Ser Met Phe Gly Leu Lys 
50 55 60 

Gin Arg Pro Thr Pro Ser Arg Asp Ala Val Val Pro Pro Tyr Met Leu 
65 70 75 80 

Asp Leu Tyr Arg Arg His Ser Gly Gin Pro Gly Ser Pro Ala Pro Asp 

85 90 95 

His Arg Leu Glu Arg Ala Ala Ser Arg Ala Asn Thr Val Arg Ser Phe 
100 105 110 

His His Glu Glu Ser Leu Glu Glu Leu Pro Glu Thr Ser Gly Lys Thr 
115 120 125 

Thr Arg Arg Phe Phe Phe Asn Leu Ser Ser lie Pro Thr Glu Glu Phe 
130 135 140 

lie Thr Ser Ala Glu Leu Gin Val Phe Arg Glu Gin Met Gin Asp Ala 
145 150 155 160 

Leu Gly Asn Asn Ser Ser Phe His His Arg lie Asn lie Tyr Glu lie 
165 170 175 

lie Lys Pro Ala Thr Ala Asn Ser Lys Phe Pro Val Thr Arg Leu Leu 
180 185 190 

Asp Thr Arg Leu Val Asn Gin Asn Ala Ser Arg Trp Glu Thr Phe Asp 
195 200 205 

Val Thr Pro Ala Val Met Arg Trp Thr Ala Gin Gly His Ala Asn His 
210 215 220 

Gly Phe Val Val Glu Val Ala His Leu Glu Glu Lys Gin Gly Val Ser 
225 230 235 240 

Lys Arg His Val Arg lie Ser Arg Ser Leu His Gin Asp Glu His Ser 
245 250 255 

Trp Ser Gin lie Arg Pro Leu Leu Val Thr Phe Gly His Asp Gly Lys 
260 265 270 

Gly His Pro Leu His Lys Arg Glu Lys Arg Gin Ala Lys His Lys Gin 



# # 



GTCGACTCTA 


GAGTGTGTGT 


CAGCACTTGG 


CTGGGGACTT 


CTTGAACTTG 


CAGGGAGAAT 


60 


AACTTGCGCA 


CCCCACTTTG 


CGCCGGTGCC 


TTTGCCCCAG 


CGGAGCCTGC 


TTCGCCATCT 


120 


CCGAGCCCCA 


CCGCCCCTCC 


ACTCCTCGGC 


CTTGCCCGAC 


ACTGAGACGC 


TGTTCCCAGC 


180 


GTGAAAAGAG 


AGACTGCGCG 


GCCGGCACCC 


GGGAGAAGGA 


GGAGGCAAAG 


AAAAGGAACG 


240 


GACATTCGGT 


CCTTGCGCCA 


GGTCCTTTGA 


CCAGAGTTTT 


TCCATGTGGA 


CGCTCTTTCA 


300 


ATGGACGTGT 


CCCCGCGTGC 


TTCTTAGACG 


GACTGCGGTC 


TCCTAAAGGT 


CGACC ATG 


358 



Met 
1 

GTG GCC GGG ACC CGC TGT CTT CTA GCG TTG CTG CTT CCC GAG GTC CTC 406 
Val Ala Gly Thr Arg Cys Leu Leu Ala Leu Leu Leu Pro Gin Val Leu 
5 10 15 

CTG GGC GGC GCG GCT GGC CTC GTT CCG GAG CTG GGC CGC AGG AAG TTC 454 
Leu Gly Gly Ala Ala Gly Leu Val Pro Glu Leu Gly Arg Arg Lys Phe 
20 25 30 

GCG GCG GCG TCG TCG GGC CGC CCC TCA TCC CAG CCC TCT GAC GAG GTC 502 
Ala Ala Ala Ser Ser Gly Arg Pro Ser Ser Gin Pro Ser Asp Glu Val 

35 ,40 45 . 

CTG AGC GAG TTC GAG TTG CGG CTG CTC AGC ATG TTC GGC CTG AAA CAG 550 
Leu Ser Glu Phe Glu Leu Arg Leu Leu Ser Met Phe Gly Leu Lys Gin 
50 55 60 65 

AG A CCC ACC CCC AGC AGG GAC GCC GTG GTG CCC CCC TAC ATG CTA GAC 598 
Arg Pro Thr Pro Ser Arg Asp Ala Val Val Pro Pro Tyr Met Leu Asp 

70 75 80 

CTG TAT CGC AGG CAC TCA GGT CAG CCG GGC TCA CCC GCC CCA GAC CAC 64 6 

Leu Tyr Arg Arg His Ser Gly Gin Pro Gly Ser Pro Ala Pro Asp His 
85 90 95 

CGG TTG GAG AGG GCA GCC AGC CGA GCC AAC ACT GTG CGC AGC TTC CAC 694 
Arg Leu Glu Arg Ala Ala Ser Arg Ala Asn Thr Val Arg Ser Phe His 
100 105 110 

CAT GAA GAA TCT TTG GAA GAA CTA CCA GAA ACG AGT GGG AAA ACA ACC 742 
His Glu Glu Ser Leu Glu Glu Leu Pro Glu Thr Ser Gly Lys Thr Thr 
115 120 125 

CGG AGA TTC TTC TTT AAT TTA AGT TCT ATC CCC ACG GAG GAG TTT ATC 790 
Arg Arg Phe Phe Phe Asn Leu Ser Ser lie Pro Thr Glu Glu Phe lie 
130 135 140 145 

ACC TCA GCA GAG CTT CAG GTT TTC CGA GAA CAG ATG CAA GAT GCT TTA 8 38^ 

Thr Ser Ala Glu Leu Gin Val Phe Arg Glu Gin Met Gin Asp Ala Leu 
150 155 160 

GGA AAC AAT AGC AGT TTC CAT CAC CGA ATT AAT ATT TAT GAA ATC ATA 886 
Gly Asn Asn Ser Ser Phe His His Arg lie Asn lie Tyr Glu lie lie 
165 170 175 

AAA COT GCA ACA GCC AAC TCG AAA TTC CCC GTG ACC AGA CTT TTG GAC 934 
Lys Pro Ala Thr Ala Asn Ser Lys Phe Pro Val Thr Arg Leu Leu Asp 



180 185 190 

ACC AGG TTG GTG AAT CAG AAT GCA AGO AGG TGG GAA ACT TTT GAT GTC 982 
Thr Arg Leu Val Asn Gin Asn Ala Ser Arg Trp Glu Thr Phe Asp Val 
195 200 205 

ACC CCC GCT GTG ATG CGG TGG ACT GCA CAG GGA CAC GCC AAC CAT GGA 1030 
Thr Pro Ala Val Met Arg Trp Thr Ala Gin Gly His Ala Asn His. Gly 
210 215 220 225 

TTC GTG GTG GAA GTG GCC CAC TTG GAG GAG AAA CAA GGT GTC TCC AAG 1078 
Phe Val Val Glu Val Ala His Leu Glu Glu Lys Gin Gly Val Ser Lys 
230 235 240 

AGA CAT GTT AGG ATA AGC AGG TCT TTG CAC CAA GAT GAA CAC AGC TGG 1126 
Arg His Val Arg He Ser Arg Ser Leu His Gin Asp Glu His Ser Trp 
245 250 255 

TCA CAG ATA AGG CCA TTG CTA GTA ACT TTT GGC CAT GAT GGA AAA GGG 1174 
Ser Gin He Arg Pro Leu Leu Val Thr Phe Gly His Asp Gly Lys Gly 
260 265 270 

CAT CCT CTC CAC AAA AGA GAA AAA CGT CAA GCC AAA CAC AAA CAG CGG 1222 
His Pro Leu His Lys Arg Glu Lys Arg Gin Ala Lys His Lys Gin Arg 
275 280 285 

AAA CGC CTT AAG TCC AGC TGT AAG AGA CAC CCT TTG TAC GTG GAC TTC 127 0 

Lys Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp Phe 
290 295 300 305 

AGT GAC GTG GGG TGG AAT GAC TGG ATT GTG GCT CCC CCG GGG TAT CAC 1318 
Ser Asp Val Gly Trp Asn Asp Trp He Val Ala Pro Pro Gly Tyr His 
310 315 320 

GCC TTT TAC TGC CAC GGA GAA TGC CCT TTT CCT CTG GCT GAT CAT CTG 13 66 

Ala Phe Tyr Cys His Gly Glu Cys Pro Phe Pro Leu Ala Asp His Leu 
325 330 335 

AAC TCC ACT AAT CAT GCC ATT GTT CAG ACG TTG GTC AAC TCT GTT AAC 1414 
Asn Ser Thr Asn His Ala He Val Gin Thr Leu Val Asn Ser Val Asn 
340 345 350 

TCT AAG ATT CCT AAG GCA TGC TGT GTC CCG ACA GAA CTC AGT GCT ATC 14 62 

Ser Lys He Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala He 
355 360 365 

TCG ATG CTG TAC CTT GAC GAG AAT GAA AAG GTT GTA TTA AAG AAC TAT 1510 
Ser Met Leu Tyr Leu Asp Glu Asn Glu Lys Val Val Leu Lys Asn Tyr 
370 375 380 385 

CAG GAC ATG GTT GTG GAG GGT TGT GGG TGT CGC TAGTACAGCA AAATTAAATA 1563 
Gin Asp Met Val Val Glu Gly Cys Gly Cys Arg 
390 395 

CATAAATATA TATATATATA TATATTTTAG AAAAAAGAAA AAAA 1607 



(2) INFORMATION FOR SEQ ID NO: 2: 

(i) SEQUENCE CHARACTERISTICS: 



275 280 285 

Arg Lys Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp 
290 295 300 

Phe Ser Asp Val Gly Trp Asn Asp Trp lie Val Ala Pro Pro Gly Tyr 
305 310 315 320 

His Ala Phe Tyr Cys His Gly Glu Cys Pro Phe Pro Leu Ala Asp His 
325 330 335 

Leu Asn Ser Thr Asn His Ala He Val Gin Thr Leu Val Asn Ser Val 
340 345 350 

Asn Ser Lys He Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala 
355 360 365 

He Ser Met Leu Tyr Leu Asp Glu Asn Glu Lys Val Val Leu Lys Asn 
370 375 380 

Tyr Gin Asp Met Val Val Glu Gly Cys Gly Cys Arg 
385 390 395 

(2) INFORMATION FOR SEQ ID NO: 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1954 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/ KEY: CDS 

(B) LOCATION: 403.. 1626 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

CTCTAGAGGG CAGAGGAGGA GGGAGGGAGG GAAGGAGCGC GGAGCCCGGC CCGGAAGCTA 

GGTGAGTGTG GCATCCGAGC TGAGGGACGC GAGCCTGAGA CGCCGCTGCT GCTCCGGCTG 

AGTATCTAGC TTGTCTCCCC GATGGGATTC CCGTCCAAGC TATCTCGAGC CTGCAGCGCC 

ACAGTCCCCG GCCCTCGCCC AGGTTCACTG CAACCGTTCA GAGGTCCCCA GGAGCTGCTG 

CTGGCGAGCC CGCTACTGCA GGGACCTATG GAGCCATTCC GTAGTGCCAT CCCGAGCAAC 

GCACTGCTGC AGCTTCCCTG AGCCTTTCCA GCAAGTTTGT TCAAGATTGG CTGTCAAGAA 

TCATGGACTG TTATTATATG CCTTGTTTTC TGTCAAGACA CC ATG ATT CCT GGT 

Met He Pro Gly 
1 

AAC CGA ATG CTG ATG GTC GTT TTA TTA TGC CAA GTC CTG CTA GGA GGC 
Asn Arg Met Leu Met Val Val Leu Leu Cys Gin Val Leu Leu Gly Gly 
5 10 15 20 



GCG AGC CAT GCT ACT TTG ATA CCT GAG ACG GGG AAG AAA AAA GTC GCC 510 
Ala Ser His Ala Ser Leu lie Pro Glu Thr Gly Lys Lys Lys Val Ala 

25 30 35 

GAG ATT CAG GGC CAC GCG GGA GGA CGC CGC TCA GGG CAG AGC CAT GAG 558 
Glu He Gin Gly His Ala Gly Gly Arg Arg Ser Gly Gin Ser His Glu 
40 45 50 

CTC CTG CGG GAC TTC GAG GCG ACA CTT CTG CAG ATG TTT GGG CTG CGC 606 
Leu Leu Arg Asp Phe Glu Ala Thr Leu Leu Gin Met Phe Gly Leu Arg 
55 60 65 

CGC CGC CCG CAG CCT AGC AAG AGT GCC GTC ATT CCG GAC TAC ATG CGG 654 
Arg Arg Pro Gin Pro Ser Lys Ser Ala Val He Pro Asp Tyr Met Arg 
70 75 80 

GAT CTT TAC CGG CTT CAG TCT GGG GAG GAG GAG GAA GAG CAG ATC CAC 702 
Asp Leu Tyr Arg Leu Gin Ser Gly Glu Glu Glu Glu Glu Gin He His 
85 90 95 100 

AGC ACT GGT CTT GAG TAT CCT GAG CGC CCG GCC AGC CGG GCC AAC ACC 750 
Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala Ser Arg Ala Asn Thr 
105 110 115 

GTG AGG AGC TTC CAC CAC GAA GAA CAT CTG GAG AAC ATC CCA GGG ACC 798 
Val Arg Ser Phe His His Glu Glu His Leu Glu Asn He Pro Gly Thr 
120 125 130 

AGT GAA AAC TCT GCT TTT CGT TTC CTC TTT AAC CTC AGC AGC ATC CCT 84 6 

Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu Ser Ser He Pro 
135 140 145 

GAG AAC GAG GTG ATC TCC TCT GCA GAG CTT CGG CTC TTC CGG GAG CAG 894 
Glu Asn Glu Val He Ser Ser Ala Glu Leu Arg Leu Phe Arg Glu Gin 
150 155 160 

GTG GAC CAG GGC CCT GAT TGG GAA AGG GGC TTC CAC CGT ATA AAC ATT 942 
Val Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His Arg He Asn He 
165 170 175 180 

TAT GAG GTT ATG AAG CCC CCA GCA GAA GTG GTG CCT GGG CAC CTC ATC 990 
Tyr Glu Val Met Lys Pro Pro Ala Glu Val Val Pro Gly His Leu He 
185 190 195 

ACA CGA CTA CTG GAC ACG AGA CTG GTC CAC CAC AAT GTG ACA CGG TGG 1038 
Thr Arg Leu Leu Asp Thr Arg Leu Val His His Asn Val Thr Arg Trp 
200 205 210 

GAA ACT TTT GAT GTG AGC CCT GCG GTC CTT CGC TGG ACC CGG GAG AAG 1086 
Glu Thr Phe Asp Val Ser Pro Ala Val Leu Arg Trp Thr Arg Glu Lys 
215 220 225 

CAG CCA AAC TAT GGG CTA GCC ATT GAG GTG ACT CAC CTC CAT CAG ACT 1134 
Gin Pro Asn Tyr Gly Leu Ala He Glu Val Thr His Leu His Gin Thr 
230 235 240 

CGG ACC CAC CAG GGC CAG CAT GTC AGG ATT AGC CGA TCG TTA CCT CAA 1182 
Arg Thr His Gin Gly Gin His Val Arg He Ser Arg Ser Leu Pro Gin 
245 250 255 260 



GGG AGT GGG AAT TGG GCC CAG CTC CGG CCC CTC CTG GTC ACC TTT GGC 1230 
Gly Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu Val Thr Phe Gly 
265 270 275 

CAT GAT GGC CGG GGC CAT GCC TTG ACC CGA CGC CGG AGG GCC AAG CGT 1278 
His Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg Arg Ala Lys Arg 
280 285 290 

AGC CCT AAG CAT CAC TCA CAG CGG GCC AGG AAG AAG AAT AAG AAC TGC 1326 
Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys Asn Lys Asn Cys 
295 300 305 

CGG CGC CAC TCG CTC TAT GTG GAC TTC AGC GAT GTG GGC TGG AAT GAC 1374 
Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp 
310 315 320 

TGG ATT GTG GCC CCA CCA GGC TAC CAG GCC TTC TAG TGC CAT GGG GAC 1422 
Trp He Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr Cys His Gly Asp 
325 330 335 340 

TGC CCC TTT CCA CTG GCT GAC CAC CTC AAC TCA ACC AAC CAT GCC ATT 1470 
Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala He 
345 350 355 

GTG CAG ACC CTG GTC AAT TCT GTC AAT TCC AGT ATC CCC AAA GCC TGT 1518 
Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser He Pro Lys Ala Cys 
360 365 •'■ 370 

TGT GTG CCC ACT GAA CTG AGT GCC ATC TCC ATG CTG TAC CTG GAT GAG 1566 
Cys Val Pro Thr Glu Leu Ser Ala He Ser Met Leu Tyr Leu Asp Glu 
375 380 385 

TAT GAT AAG GTG GTA CTG AAA AAT TAT CAG GAG ATG GTA GTA GAG GGA 1614 
Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met Val Val (31u Gly 
390 395 400 

TGT GGG TGC CGC TGAGATCAGG CAGTCCTTGA GGATAGACAG ATATACACAC 1666 

Cys Gly Cys Arg 

405 



CACACACACA 


CACCACATAC 


ACCACACACA 


CACGTTCCCA 


TCCACTCACC 


CACACACTAC 


1726 


ACAGACTGCT 


TCCTTATAGC 


TGGACTTTTA 


TTTAAAAAAA 


AAAAAAAAAA 


AATGGAAAAA 


1786 


ATCCCTAAAC 


ATTCACCTTG 


ACCTTATTTA 


TGACTTTACG 


TGCAAATGTT 


TTGACCATAT 


1846 


TGATCATATA 


TTTTGACAAA 


ATATATTTAT 


AACTACGTAT 


TAAAAGAAAA 


AAATAAAATG 


1906 


AGTCATTATT 


TTAAAAAAAA 


AAAAAAAACT 


CTAGAGTCGA 


CGGAATTC 




1954 



(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 408 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

Met lie Pro Gly Asn Arg Met Leu Met Val Val Leu Leu Cys Gin Val 
15 10 15 

Leu Leu Gly Gly Ala Ser His Ala Ser Leu He Pro Glu Thr Gly Lys 
20 25 30 

Lys Lys Val Ala Glu He Gin Gly His Ala Gly Gly Arg Arg Ser Gly 
35 40 45 

Gin Ser His Glu Leu Leu Arg Asp Phe Glu Ala Thr Leu Leu Gin Met 
50 55 60 

Phe Gly Leu Arg Arg Arg Pro Gin Pro Ser Lys Ser Ala Val He Pro 
65 70 75 80 

Asp Tyr Met Arg Asp Leu Tyr Arg Leu Gin Ser Gly Glu Glu Glu Glu 

85 90 95 

Glu Gin He His Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala Ser 
100 105 110 

Arg Ala Asn Thr Val Arg Ser Phe His His Glu Glu His Leu Glu Asn 
115 120 125 

He Pro Gly Thr Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu 
130 135 140 

Ser Ser He Pro Glu Asn Glu Val He Ser Ser Ala Glu Leu Arg Leu 
145 150 155 160 

Phe Arg Glu Gin Val Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His 
165 170 175 

Arg He Asn He Tyr Glu Val Met Lys Pro Pro Ala Glu Val Val Pro 
180 185 190 

Gly His Leu He Thr Arg Leu Leu Asp Thr Arg Leu Val His His Asn 
195 200 205 

Val Thr Arg Trp Glu Thr Phe Asp Val Ser Pro Ala Val Leu Arg Trp 
210 215 220 

Thr Arg Glu Lys Gin Pro Asn Tyr Gly Leu Ala He Glu Val Thr His 
225 230 235 240 

Leu His Gin Thr Arg Thr His Gin Gly Gin His Val Arg He Ser Arg 
245 250 255 

Ser Leu Pro Gin Gly Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu 
260 265 270 

Val Thr Phe Gly His Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg 
275 280 285 

Arg Ala Lys Arg Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys 
290 295 300 



Asn Lys Asn Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val 
305 310 315 320 

Gly Trp Asn Asp Trp He Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr 
325 330 335 

Cys His Gly Asp Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr 
340 345 350 

Asn His Ala He Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser He 
355 360 365 

Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala He Ser Met Leu 
370 375 380 

Tyr Leu Asp Glu Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met 
385 390 395 400 

Val Val Glu Gly Cys Gly Cys Arg 
405 

(2) INFORMATION FOR SEQ ID NO: 5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1448 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: unknown 

(ii) MOLECULE TYPE: DNA (genomic) 



(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 97.. 1389 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

GTGACCGAGC GGCGCGGACG GCCGCCTGCC CCCTCTGCCA CCTGGGGCGG TGCGGGCCCG 

GAGCCCGGAG CCCGGGTAGC GCGTAGAGCC GGCGCG ATG CAC GTG CGC TCA CTG 

Met His Val Arg Ser Leu 
1 5 

CGA GCT GCG GCG CCG CAC AGC TTC GTG GCG CTC TGG GCA CCC CTG TTC 
Arg Ala Ala Ala Pro His Ser Phe Val Ala Leu Trp Ala Pro Leu Phe 
10 15 20 

CTG CTG CGC TCC GCC CTG GCC GAC TTC AGC CTG GAC AAC GAG GTG CAC 
Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser Leu Asp Asn Glu Val His 
25 30 35 

TCG AGC TTC ATC CAC CGG CGC CTC CGC AGC CAG GAG CGG CGG GAG ATG 
Ser Ser Phe He His Arg Arg Leu Arg Ser Gin Glu Arg Arg Glu Met 
40 45 50 



CAG CGC GAG ATC CTC TCC ATT TTG GGC TTG CCC CAC CGC CCG CGC CCG 
Gin Arg Glu He Leu Ser He Leu Gly Leu Pro His Arg Pro Arg Pro 
55 60 65 70 



CAC CTC CAG GGC AAG CAC AAC TCG GCA CCC ATG TTC ATG CTG GAC CTG 354 
His Leu Gin Gly Lys His Asn Ser Ala Pro Met Phe Met Leu Asp Leu 

75 80 85 

TAG AAC GGC ATG GOG GTG GAG GAG GGC GGC GGG CCC GGC GGC CAG GGC 402 
Tyr Asn Ala Met Ala Val Glu Glu Gly Gly Gly Pro Gly Gly Gin Gly 
90 95 100 

TTC TCC TAC CCC TAC AAG GCC GTC TTC AGT ACC CAG GGC CCC CCT CTG 450 
Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser Thr Gin Gly Pro Pro Leu 
105 110 115 

GCC AGC CTG CAA GAT AGC CAT TTC CTC ACC GAC GCC GAC ATG GTC ATG 498 
Ala Ser Leu Gin Asp Ser His Phe Leu Thr Asp Ala Asp Met Val Met 
120 125 130 

AGC TTC GTC AAC CTC GTG GAA CAT GAC AAG GAA TTC TTC CAC CCA CGC 546 
Ser Phe Val Asn Leu Val Glu His Asp Lys Glu Phe Phe His Pro Arg 
135 140 145 150 

TAC CAC CAT CGA GAG TTC CGG TTT GAT CTT TCC AAG ATC CCA GAA GGG 594 
Tyr His His Arg Glu Phe Arg Phe Asp Leu Ser Lys lie Pro Glu Gly 
155 160 165 

GAA GCT GTC ACG GCA GCC GAA TTC CGG ATC TAC AAG GAC TAC ATC CGG 642 
Glu Ala Val Thr Ala Ala Glu Phe Arg lie Tyr Lys Asp Tyr lie Arg 
170 175 180 

GAA CGC TTC GAC AAT GAG ACG TTC CGG ATC AGC GTT TAT CAG GTG CTC 690 
Glu Arg Phe Asp Asn Glu Thr Phe Arg lie Ser Val Tyr Gin Val Leu 
185 190 195 

CAG GAG CAC TTG GGC AGG GAA TCG GAT CTC TTC CTG CTC GAC AGC CGT 73 8 

Gin Glu His Leu Gly Arg Glu Ser Asp Leu Phe Leu Leu Asp Ser Arg 
200 205 210 

ACC CTC TGG GCC TCG GAG GAG GGC TGG CTG GTG TTT GAC ATC ACA GCC 786 
Thr Leu Trp Ala Ser Glu Glu Gly Trp Leu Val Phe Asp lie Thr Ala 
215 220 225 230 

ACC AGC AAC CAC TGG GTG GTC AAT CCG CGG CAC AAC CTG GGC CTG CAG 83 4 

Thr Ser Asn His Trp Val Val Asn Pro Arg His Asn Leu Gly Leu Gin 
235 240 245 

CTC TCG GTG GAG ACG CTG GAT GGG CAG AGC ATC AAC CCC AAG TTG GCG 882 
Leu Ser Val Glu Thr Leu Asp Gly Gin Ser lie Asn Pro Lys Leu Ala 
250 255 260 

GGC CTG ATT GGG CGG CAC GGG CCC CAG AAC AAG CAG CCC TTC ATG GTG 93 0 

Gly Leu lie Gly Arg His Gly Pro Gin Asn Lys Gin Pro Phe Met Val 
265 270 275 

GCT TTC TTC AAG GCC ACG GAG GTC CAC TTC CGC AGC ATC CGG TCC ACG 97 8 

Ala Phe Phe Lys Ala Thr Glu Val His Phe Arg Ser lie Arg Ser Thr 
280 285 290 

GGG AGC AAA CAG CGC AGC CAG AAC CGC TCC AAG ACG CCC AAG AAC CAG 102 6 

Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser Lys Thr Pro Lys Asn Gin 
295 300 305 310 



C3AA GCC CTG CGG ATG GCC AAC GTG GCA GAG AAC AGC AGC AGC GAC GAG 1074 
CGlu Ala Leu Arg Met Ala Asn Val Ala Glu Asn Ser Ser Ser Asp Gin 
315 320 325 

JVGG CAG GCC TGT AAG AAG CAC GAG CTG TAT GTC AGC TTC CGA GAC CTG 1122 
J^rq Gin Ala Cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp Leu 
330 335 340 

GGC TGG CAG GAC TGG ATC ATC GCG CCT GAA GGC TAC GCC GCC TAC TAG 1170 
Gly Trp Gin Asp Trp He He Ala Pro Glu Gly Tyr Ala Ala Tyr Tyr 
345 350 355 

TGT GAG GGG GAG TGT GCC TTC CCT CTG AAC TCC TAC ATG AAC GCC ACC 1218 
Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn Ser Tyr Met Asn Ala Thr 
360 365 370 

AAC CAC GCC ATC GTG CAG ACG CTG GTC CAC TTC ATC AAC CCG GAA ACG 12 66 

Asn His Ala He Val Gin Thr Leu Val His Phe He Asn Pro Glu Thr 
375 380 385 390 

GTG CCC AAG CCC TGC TGT GCG CCC ACG CAG CTC AAT GCC ATC TCC GTC 1314 
Val Pro Lvs Pro Cys Cys Ala Pro Thr Gin Leu Asn Ala He Ser Val 
3^5 400 405 

CTC TAC TTC GAT GAC AGC TCC AAC GTC ATC CTG AAG AAA TAC AGA AAC 13 62 

Leu Tvr Phe Asp Asp Ser Ser Asn Val He Leu Lys Lys Tyr Arg Asn 
410 415 420 

ATG GTG GTC CGG GCC TGT GGC TGC CAC TAGCTCCTCC GAGAATTCAG 14 09 

Met Val Val Arg Ala Cys Gly Cys His 
425 430 

ACCCTTTGGG GCCAAGTTTT TCTGGATCCT CCATTGCTC 1448 

(2) INFORMATION FOR SEQ ID NO: 6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

Met His Val Arg Ser Leu Arg Ala Ala Ala Pro His Ser Phe Val Ala 

1 5 10 15 ■ 

Leu Trp Ala Pro Leu Phe Leu Leu Arg Ser Ala Leu Ala Asp Phe Ser 
20 25 30 

Leu Asp Asn Glu Val His Ser Ser Phe He His Arg Arg Leu Arg Ser 
35 40 45 

Gin Glu Arg Arg Glu Met Gin Arg Glu He Leu Ser lie Leu Gly Leu 
50 55 60 

Pro His Arg Pro Arg Pro His Leu Gin Gly Lys His Asn Ser Ala Pro 



65 



70 



75 



80 



Met Phe Met Leu Asp Leu Tyr Asn Ala Met Ala Val Glu Glu Gly Gly 

85 90 95 

Gly Pro Gly Gly Gin Gly Phe Ser Tyr Pro Tyr Lys Ala Val Phe Ser 
100 105 110 

Thr Gin Gly Pro Pro Leu Ala Ser Leu Gin Asp Ser His Phe Leu Thr 
115 120 125 

Asp Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu His Asp Lys 
130 135 140 

Glu Phe Phe His Pro Arg Tyr His His Arg Glu Phe Arg Phe Asp Leu 
145 150 155 160 

Ser Lys He Pro Glu Gly Glu Ala Val Thr Ala Ala Glu Phe Arg He 
165 170 175 

Tyr Lys Asp Tyr He Arg Glu Arg Phe Asp Asn Glu Thr Phe Arg He 
180 185 190 

Ser Val Tyr Gin Val Leu Gin Glu His Leu Gly Arg Glu Ser Asp Leu 
195 200 205 

Phe Leu Leu Asp Ser Arg Thr Leu Trp Ala Ser Glu Glu Gly Trp Leu 
210 215 220 

Val Phe Asp He Thr Ala Thr Ser Asn His Trp Val Val Asn Pro Arg 
225 230 235 240 

His Asn Leu Gly Leu Gin Leu Ser Val Glu Thr Leu Asp Gly Gin Ser 
245 250 255 

He Asn Pro Lys Leu Ala Gly Leu He Gly Arg His Gly Pro Gin Asn 
260 265 270 

Lys Gin Pro Phe Met Val Ala Phe Phe Lys Ala Thr Glu Val His Phe 
275 280 285 

Arg Ser He Arg Ser Thr Gly Ser Lys Gin Arg Ser Gin Asn Arg Ser 
290 295 300 

Lys Thr Pro Lys Asn Gin Glu Ala Leu Arg Met Ala Asn Val Ala Glu 
305 310 315 320 

Asn Ser Ser Ser Asp Gin Arg Gin Ala Cys Lys Lys His Glu Leu Tyr 
325 330 335 

Val Ser Phe Arg Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Glu 
340 345 350 

Gly Tyr Ala Ala Tyr Tyr Cys Glu Gly Glu Cys Ala Phe Pro Leu Asn 
355 360 365 

Ser Tyr Met Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val His 
370 375 380 

Phe He Asn Pro Glu Thr Val Pro Lys Pro Cys Cys Ala Pro Thr Gin 



385 390 395 400 

Leu Asn Ala lie Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn Val lie 
405 410 415 

Leu Lys Lys Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 
420 425 430 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2923 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(F) TISSUE TYPE: Human placenta 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Stratagene catalog #936203 Human placenta 

cDNA library 

(B) CLONE: BMP6C35 

(viii) POSITION IN GENOME: 

(C) UNITS: bp 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 160.. 1701 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1282.. 1698 

(ix) FEATURE: 

(A) NAME/KEY: mRNA 

(B) LOCATION: 1..2923 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CGACCATGAG AGATAAGGAC TGAGGGCCAG GAAGGGGAAG CGAGCCCGCC GAGAGGTGGC 

GGGGACTGCT CACGCCAAGG GCCACAGCGG CCGCGCTCCG GCCTCGCTCC GCCGCTCCAC 

GCCTCGCGGG ATCCGCGGGG GCAGCCCGGC CGGGCGGGG ATG CCG GGG CTG GGG 

Met Pro Gly Leu Gly 
-374 -370 

CGG AGG GCG CAG TGG CTG TGC TGG TGG TGG GGG CTG CTG TGC AGC TGC 
Arg Arg Ala Gin Trp Leu Cys Trp Trp Trp Gly Leu Leu Cys Ser Cys 
-365 -360 -355 



TGC GGG CCC CCG CCG CTG CGG CCG CCC TTG CCC GCT GCC GCG GCC GCC 



Cys Gly Pro Pro Pro Leu Arg Pro Pro Leu Pro Ala Ala Ala Ala Ala 
-350 -345 -340 

GCC GCC GGG GGG GAG CTG CTG GGG GAG GGC GGG AGC CCC GGC CGC ACG 318 
Ala Ala Gly Gly Gin Leu Leu Gly Asp Gly Gly Ser Pro Gly Arg Thr 
-335 -330 -325 

GAG GAG CCG CCG CCG TCG CCG GAG TCC TOO TCG GGC TTC CTG TAC COG 3 66 

Glu Gin Pro Pro Pro Ser Pro Gin Ser Ser Ser Gly Phe Leu Tyr Arg 
-320 -315 -310 

CGG CTC AAG ACG CAG GAG AAG CGG GAG ATG CAG AAG GAG ATC. TTG TCG 414 
Arg Leu Lys Thr Gin Glu Lys Arg Glu Met Gin Lys Glu lie Leu Ser 
-305 -300 -295 -290 

GTG CTG GGG CTC CCG CAC CGG CCC CGG CCC CTG CAC GGC CTC CAA CAG 462 
Val Leu Gly Leu Pro His Arg Pro Arg Pro Leu His Gly Leu Gin Gin 
-285 -280 -275 

CCG CAG CCC CCG GCG CTC CGG CAG CAG GAG GAG CAG CAG CAG CAG CAG 510 
Pro Gin Pro Pro Ala Leu Arg Gin Gin Glu Glu Gin Gin Gin Gin Gin 
-270 -265 -260 

CAG CTG CCT CGC GGA GAG CCC CCT CCC GGG CGA CTG AAG TCC GCG CCC 558 
Gin Leu Pro Arg Gly Glu Pro Pro Pro Gly Arg Leu Lys Ser Ala Pro 
-255 -250 -245 

CTC TTC ATG CTG GAT CTG TAC AAC GCC CTG TCC GCC GAC AAC GAC GAG 606 
Leu Phe Met Leu Asp Leu Tyr Asn Ala Leu Ser Ala Asp Asn Asp Glu 
-240 -235 -230 

GAC GGG GCG TCG GAG GGG GAG AGG CAG CAG TCC TGG CCC CAC GAA GCA 654 
Asp Gly Ala Ser Glu Gly Glu Arg Gin Gin Ser Trp Pro His Glu Ala 
-225 -220 -215 -210 

GCC AGC TCG TCC CAG CGT CGG CAG CCG CCC CCG GGC GCC GCG CAC CCG 702 
Ala Ser Ser Ser Gin Arg Arg Gin Pro Pro Pro Gly Ala Ala His Pro 
-205 -200 -195 

CTC AAC CGC AAG AGC CTT CTG GCC CCC GGA TCT GGC AGC GGC GGC GCG 750 
Leu Asn Arg Lys Ser Leu Leu Ala Pro Gly Ser Gly Ser Gly Gly Ala 
-190 -185 -180 

TCC CCA CTG ACC AGC GCG CAG GAC AGC GCC TTC CTC AAC GAC GCG GAC 798 
Ser Pro Leu Thr Ser Ala Gin Asp Ser Ala Phe Leu Asn Asp Ala Asp 
-175 -170 -165 

ATG GTC ATG AGC TTT GTG AAC CTG GTG GAG TAC GAC AAG GAG TTC TCC 84 6 

Met Val Met Ser Phe Val Asn Leu Val Glu Tyr Asp Lys Glu Phe Ser 
-160 -155 -150 

CCT CGT CAG CGA CAC CAC AAA GAG TTC AAG TTC AAC TTA TCC CAG ATT 894 
Pro Arg Gin Arg His His Lys Glu Phe Lys Phe Asn Leu Ser Gin lie 
-145 -140 -135 -130 

CCT GAG GGT GAG GTG GTG ACG GCT GCA GAA TTC CGC ATC TAC AAG GAC 94 2 

Pro Glu Gly Glu Val Val Thr Ala Ala Glu Phe Arg. lie Tyr Lys Asp 
-125 -120 -115 

TGT GTT ATG GGG AGT TTT AAA AAC CAA ACT TTT CTT ATC AGC ATT TAT 99 0 



Cys Val Met Gly Ser Phe Lys Asn Gin Thr Phe Leu lie Ser He Tyr 
-110 -105 -100 

CAA GTC TTA CAG GAG CAT CAG CAC AGA GAC TCT GAG CTG TTT TTG TTG 
Gin Val Leu Gin Glu His Gin His Arg Asp Ser Asp Leu Phe Leu Leu 
-95 -90 -85 

GAC ACC CGT GTA GTA TGG GCC TCA GAA GAA GGC TGG CTG GAA TTT GAC 
Asp Thr Arg Val Val Trp Ala Ser Glu Glu Gly Trp Leu Glu Phe Asp 
-80 -75 _70 

ATC ACG GCC ACT AGC AAT CTG TGG GTT GTG ACT CCA CAG CAT AAC ATG 
lie Thr Ala Thr Ser Asn Leu Trp Val Val Thr Pro Gin His Asn Met 

-60 -55 -50 

GGG CTT CAG CTG AGC GTG GTG ACA AGG GAT GGA GTC CAC GTC CAC CCC 
Gly Leu Gin Leu Ser Val Val Thr Arg Asp Gly Val His Val His Pro 
-45 -40 _35 

CGA GCC GCA GGC CTG GTG GGC AGA GAC GGC CCT TAC GAT AAG CAG CCC 
Arg Ala Ala Gly Leu Val Gly Arg Asp Gly Pro Tyr Asp Lys Gin Pro 
-30 -25 -20 

TTC ATG GTG GCT TTC TTC AAA GTG AGT GAG GTC CAC GTG CGC ACC ACC 
Phe Met Val Ala Phe Phe Lys Val Ser Glu Val His Val Arg Thr Thr 
-15 -10 _5 

AGG TCA GCC TCC AGC CGG CGC CGA CAA CAG AGT CGT AAT CGC TCT ACC 
Arg Ser Ala Ser Ser Arg Arg Arg Gin Gin Ser Arg Asn Arg Ser Thr 
1 5 10 15 

CAG TCC CAG GAC GTG GCG CGG GTC TCC AGT GCT TCA GAT TAC AAC AGC 
Gin Ser Gin Asp Val Ala Arg Val Ser Ser Ala Ser Asp iyr Asn sir 

20 25 30 

AGT GAA TTG AAA ACA GCC TGC AGG AAG CAT GAG CTG TAT GTG AGT TTC 
Ser Glu Leu Lys Thr Ala Cys Arg Lys His Glu Leu Tyr Val Ser Phe 
35 40 45 

CAA GAC CTG GGA TGG CAG GAC TGG ATC ATT GCA CCC AAG GGC TAT GCT 
Gin Asp Leu Gly Trp Gin Asp Trp He He Ala Pro Lys Gly Tyr Ala 
50 55 60 

GCC AAT TAC TGT GAT GGA GAA TGC TCC TTC CCA CTC AAC GCA CAC ATG 
Ala Asn Tyr Cys Asp Gly Glu Cys Ser Phe Pro Leu Asn Ala His Met 
°5 70 75 

AAT GCA ACC AAC CAC GCG ATT GTG CAG ACC TTG GTT CAC CTT ATG AAC 
Asn Ala Thr Asn His Ala He Val Gin Thr Leu Val His Leu Met Asn 
^° 85 90 95 

CCC GAG TAT GTC CCC AAA CCG TGC TGT GCG CCA ACT AAG CTA AAT GCC 
Pro Glu Tyr Val Pro Lys Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala 
100 105 110 

ATC TCG GTT CTT TAC TTT GAT GAC AAC TCC AAT GTC ATT CTG AAA AAA 
lie Ser Val Leu Tyr Phe Asp Asp Asn Ser Asn Val He Leu Lys Lys 
115 120 125 

TAC AGG AAT ATG GTT GTA AGA GCT TGT GGA TGC CAC TAACTCGAAA 




Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys His 

130 135 140 



UCACjA TGCTG 


GGGACACACA 


TTCTGCCTTG 


GATTCCTAGA 


TTACATCTGC 


CTTAAAAAAA 


1768 




CAGTTGGAGG 


TGGGACGATG 


AGACTTTGAA 


ACTATCTCAT 


GCCAGTGCCT 


1828 


1 Ai xACUUACs 


GAAGATTTTA 


AAGGACCTCA 


TTAATAATTT 


GCTCACTTGG 


TAAATGACGT 


1888 


oA(j 1 AG i i GT 


TGGTCTGTAG 


CAAGCTGAGT 


TTGGATGTCT 


GTAGCATAAG 


GTCTGGTAAC 


1948 


1 vjUAGAAACA 


TAACCGTGAA 


GCTCTTCCTA 


CCCTCCTCCC 


CCAAAAACCC 


ACCAAAATTA 


2008 


Gil n AGCTG 


TAGATCAAGC 


TATTTGGGGT 


GTTTGTTAGT 


AAATAGGGAA 


AATAATCTCA 


2068 


AAGGAGTTAA 


ATGTATTCTT 


GGCTAAAGGA 


TCAGCTGGTT 


CAGTACTGTC 


TATCAAAGGT 


2128 


AGAl 1 1 lACA 


GAGAACAGAA 


ATCGGGGAAG 


TGGGGGGAAC 


GCCTCTGTTC 


AGTTCATTCC 


2188 


O A & A ^TiO/^X 
CAtaAAG 1 C^LoA 


CAGGACGCAC 


AGCCCAGGCC 


ACAGCCAGGG 


CTCCACGGGG 


CGCCCTTGTC 


2248 


1 ^ A^ 1 C A 1 1 G 


CTGTTGTATG 


TTCGTGCTGG 


AGTTTTGTTG 


GTGTGAAAAT 


ACACTTATTT 


2308 


p Anpr* A A A A p 


A m o ^ IV mmm ^ 

A 1 ACCATTTC 


TACACCTCAA 


TCCTCCATTT 


GCTGTACTCT 


TTGCTAGTAC 


2368 




C i GATTACAC 


TGAGGTGAGG 


CTACAAGGGG 


TGTGTAACCG 


TGTAACACGT 


2428 




V- i CACCTCTT 


CTTTACCAGA 


ACGGTTCTTT 


GACCAGCACA 


TTAACTTCTG 


2488 


A P T" P P P P ^ 


iCTAGTACCT 


TTTCAGTAAA 


GTGGTTCTCT 


GCCTTTTTAC 


TATACAGCAT 


2548 


APPAPrZPP AP 


AljljQa i i AGAA 


CCAACGAAGA 


AAATAAAATG 


AGGGTGCCCA 


GCTTATAAGA 


2608 




OjGGGATGAGC 


ATGCTGTTTA 


TGAACGGAAA 


TCATGATTTC 


CCTGTAGAAA 


2668 


GTGAGGCTCA 


GATTAAATTT 


TAGAATATTT 


TCTAAATGTC 


TTTTTCACAA 


TCATGTGACT 


2728 


GGGAAGGCAA 


TTTCATACTA 


AACTGATTAA 


ATAATACATT 


TATAATCTAC 


AACTGTTTGC 


2788 


ACTTACAGCT 


TTTTTTGTAA 


ATATAAACTA 


TAATTTATTG 


TCTATTTTAT 


ATCTGTTTTG 


2848 


CTGTGGCGTT 


GGGGGGGGGG 


CCGGGCTTTT 


GGGGGGGGGG 


GTTTGTTTGG 


GGGGTGTCGT 


2908 


GGTGTGGGCG 


GGCGG 










2923 



(2) INFORMATION FOR SEQ ID NO : 8 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 513 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 8: 

Met Pro Gly Leu Gly Arg Arg Ala Gin Trp Leu Cys Trp Trp Trp Gly 
-374 -370 -365 -360 



# 



Leu Leu Cys Ser Cys Cys Gly Pro Pro Pro Leu Arg Pro Pro Leu Pro 
-355 -350 -345 

Ala Ala Ala Ala Ala Ala Ala Gly Gly Gin Leu Leu Gly Asp Gly Gly 
-340 -335 -330 

Ser Pro Gly Arg Thr Glu Gin Pro Pro Pro Ser Pro Gin Ser Ser Ser 
-325 -320 -315 

Gly Phe Leu Tyr Arg Arg Leu Lys Thr Gin Glu Lys Arg Glu Met Gin 
-310 -305 -300 -295 

Lys Glu lie Leu Ser Val Leu Gly Leu Pro His Arg Pro Arg Pro Leu 
-290 -285 -280 

His Gly Leu Gin Gin Pro Gin Pro Pro Ala Leu Arg Gin Gin Glu Glu 
-275 -270 -265 

Gin Gin Gin Gin Gin Gin Leu Pro Arg Gly Glu Pro Pro Pro Gly Arg 
-260 -255 -250 

Leu Lys Ser Ala Pro Leu Phe Met Leu Asp Leu Tyr Asn Ala Leu Ser 
-245 -240 -235 

Ala Asp Asn Asp Glu Asp Gly Ala Ser Glu Gly Glu Arg Gin Gin Ser 
-230 -225 -220 -215 

Trp Pro His Glu Ala Ala Ser Ser Ser Gin Arg Arg Gin Pro Pro Pro 
-210 -205 -200 

Gly Ala Ala His Pro Leu Asn Arg Lys Ser Leu Leu Ala Pro Gly Ser 
-195 -190 -185 

Gly Ser Gly Gly Ala Ser Pro Leu Thr Ser Ala Gin Asp Ser Ala Phe 
-180 -175 -170 

Leu Asn Asp Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu Tyr 
-165 -160 -155 



Asp Lys Glu Phe Ser Pro Arg Gin Arg His" His Lys Glu Phe Lys Phe 
-150 -145 _140 

Asn Leu Ser Gin He Pro Glu Gly Glu Val Val Thr Ala Ala Glu Phe 
-130 -125 -120 

Arg He Tyr Lys Asp Cys Val Met Gly Ser Phe Lys Asn Gin Thr Phe 
-115 -110 -105 

Leu He Ser He Tyr Gin Val Leu Gin Glu His Gin His Arg Asp Ser 
-100 -95 -90 

Asp Leu Phe Leu Leu Asp Thr Arg Val Val Trp Ala Ser Glu Glu Gly 
-85 -80 -75 

Trp Leu Glu Phe Asp He Thr Ala Thr Ser Asn Leu Trp Val Val Thr 
-70 -65 -60 -55 

Pro Gin His Asn Met Gly Leu Gin Leu Ser Val Val Thr Arg Asp Gly 
-50 -45 -40 



-135 



Val His Val His Pro Arg Ala Ala Gly Leu Val Gly Arg Asp Gly Pro 
-35 -30 -25 

Tyr Asp Lys Gin Pro Phe Met Val Ala Phe Phe Lys Val Ser Glu Val 
-20 -15 -10 

His Val Arg Thr Thr Arg Ser Ala Ser Ser Arg Arg Arg Gin Gin Ser 
-5 15 10 

Arg Asn Arg Ser Thr Gin Ser Gin Asp Val Ala Arg Val Ser Ser Ala 

15 20 25 

Ser Asp Tyr Asn Ser Ser Glu Leu Lys Thr Ala Cys Arg Lys His Glu 
30 35 40 

Leu Tyr Val Ser Phe Gin Asp Leu Gly Trp Gin Asp Trp lie lie Ala 
45 50 55 

Pro Lys Gly Tyr Ala Ala Asn Tyr Cys Asp Gly Glu Cys Ser Phe Pro 
60 65 70 

Leu Asn Ala His Met Asn Ala Thr Asn His Ala lie Val Gin Thr Leu 
75 80 85 90 

Val His Leu Met Asn Pro Glu Tyr Val Pro Lys Pro Cys Cys Ala Pro 

95 100 105 

Thr Lys Leu Asn Ala lie Ser Val Leu Tyr Phe Asp Asp Asn Ser Asn 
110 115 120 

Val lie Leu Lys Lys Tyr Arg Asn Met Val Val Arg Ala Cys Gly Cys 
125 130 135 

His 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2153 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(H) CELL LINE: U2-0S osteosarcoma 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: U2-0S human osteosarcoma cDNA library 

(B) CLONE: U2-16 

(viii) POSITION IN GENOME: 

(C) UNITS: bp' 

(ix) FEATURE: 

(A) NAME/KEY: CDS 



(B) LOCATION: 699.. 2063 

(ix) FEATURE: 

(A) NAME/KEY: inat_peptide 

(B) LOCATION: 1647.. 2060 

(ix) FEATURE: 

(A) NAME/ KEY: mRNA 

(B) LOCATION: 1..2153 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

CTGGTATATT TGTGCCTGCT GGAGGTGGAA TTAACAGTAA GAAGGAGAAA GGGATTGAAT 

GGACTTACAG GAAGGATTTC AAGTAAATTC AGGGAAACAC ATTTACTTGA ATAGTACAAC 

CTAGAGTATT ATTTTACACT AAGACGACAC AAAAGATGTT AAAGTTATCA CCAAGCTGCC 

GGACAGATAT ATATTCCAAC ACCAAGGTGC AGATCAGCAT AGATCTGTGA TTCAGAAATC 

AGGATTTGTT TTGGAAAGAG CTCAAGGGTT GAGAAGAACT CAAAAGCAAG TGAAGATTAC 

TTTGGGAACT ACAGTTTATC AGAAGATCAA CTTTTGCTAA TTCAAATACC AAAGGCCTGA 

TTATCATAAA TTCATATAGG AATGCATAGG TCATCTGATC AAATAATATT AGCCGTCTTC 

TGCTACATCA ATGCAGCAAA AACTCTTAAC AACTGTGGAT AATTGGAAAT CTGAGTTTCA 

GCTTTCTTAG AAATAACTAC TCTTGACATA TTCCAAAATA TTTAAAATAG GACAGGAAAA 

TCGGTGAGGA TGTTGTGCTC AGAAATGTCA CTGTCATGAA AAATAGGTAA ATTTGTTTTT 

TCAGCTACTG GGAAACTGTA CCTCCTAGAA CCTTAGGTTT TTTTTTTTTT AAGAGGACAA 

GAAGGACTAA AAATATCAAC TTTTGCTTTT GGACAAAA ATG CAT CTG ACT GTA 

Met His Leu Thr Val 
-316-315 

TTT TTA CTT AAG GGT ATT GTG GGT TTC CTC TGG AGC TGC TGG GTT CTA 
Leu Leu Lys Gly He Val Gly Phe Leu Trp Ser Cys Trp Val Leu 
-JlO -305 -300 

GTG GGT TAT GCA AAA GGA GGT TTG GGA GAC AAT CAT GTT CAC TCC AGT 
val Gly Tyr Ala Lys Gly Gly Leu Gly Asp Asn His Val His Ser Ser 

-290 -285 -280 

TTT ATT TAT AGA AGA CTA CGG AAC CAC GAA AGA CGG GAA ATA CAA AGG 
Phe lie Tyr Arg Arg Leu Arg Asn His Glu Arg Arg Glu He Gin Arg 
-275 -270 -265 

GAA ATT CTC TCT ATC TTG GGT TTG CCT CAC AGA CCC AGA CCA TTT TCA 
Glu He Leu Ser He Leu Gly Leu Pro His Arg Pro Arg Pro Phe Ser 
-260 -255 -250 

CCT GGA AAA ATG ACC AAT CAA GCG TCC TCT GCA CCT CTC TTT ATG CTr 
Pro Gly Lys Met Thr Asn Gin Ala Ser Ser Ala Pro Leu Phe Met Leu 
-245 -240 -235 

GAT CTC TAG AAT GCC GAA GAA AAT CCT GAA GAG TCG GAG TAC TCA GTA 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
713 

761 

809 

857 

905 

953 

1001 



Asp Leu Tyr Asn Ala Glu Glu Asn Pro Glu Glu Ser Glu Tyr Ser Val 
-230 -225 -220 

AGG GCA TCC TTG GCA GAA GAG ACC AGA GGG GCA AGA AAG GGA TAG CCA 1049 
Arg Ala Ser Leu Ala Glu Glu Thr Arg Gly Ala Arg Lys Gly Tyr Pro 
-215 -210 -205 -200 

GCC TOT CCC AAT GGG TAT CCT CGT CGC ATA CAG TTA TCT CGG ACG ACT 1097 
Ala Ser Pro Asn Gly Tyr Pro Arg Arg lie Gin Leu Ser Arg Thr Thr 
-195 -190 -185 

CCT CTG ACC ACC CAG AGT CCT CCT CTA GCC AGC CTC CAT GAT ACC AAC 1145 
Pro Leu Thr Thr Gin Ser Pro Pro Leu Ala Ser Leu His Asp Thr Asn 
-180 -175 -170 

TTT CTG AAT GAT GCT GAC ATG GTC ATG AGC TTT GTC AAC TTA GTT GAA 1193 
Phe Leu Asn Asp Ala Asp Met Val Met Ser Phe Val Asn Leu Val Glu 
-165 -160 -155 

AGA GAC AAG GAT TTT TCT CAC CAG CGA AGG CAT TAC AAA GAA TTT CGA 1241 
Arg Asp Lys Asp Phe Ser His Gin Arg Arg His Tyr Lys Glu Phe Arg 
-150 -145 -140 

TTT GAT CTT ACC CAA ATT CCT CAT GGA GAG GCA GTG ACA GCA GCT GAA 1289 
Phe Asp Leu Thr Gin lie Pro His Gly Glu Ala Val Thr Ala Ala Glu 
-135 -130 -125 -120 

TTC CGG ATA TAC AAG GAC CGG AGC AAC AAC CGA TTT GAA AAT GAA ACA 13 37 

Phe Arg lie Tyr Lys Asp Arg Ser Asn Asn Arg Phe Glu Asn Glu Thr 
-115 -110 1 -105 

ATT AAG ATT AGC ATA TAT CAA ATC ATC AAG GAA TAC ACA AAT AGG GAT 1385 
lie Lys lie Ser lie Tyr Gin lie lie Lys Glu Tyr Thr Asn Arg Asp 
-100 -95 -90 

GCA GAT CTG TTC TTG TTA GAC ACA AGA AAG GCC CAA GCT TTA GAT GTG 1433 
Ala Asp Leu Phe Leu Leu Asp Thr Arg Lys Ala Gin Ala Leu Asp Val 
-85 -80 -75 

GGT TGG CTT GTC TTT GAT ATC ACT GTG ACC AGC AAT CAT TGG GTG ATT 1481 
Gly Trp Leu Val Phe Asp lie Thr Val Thr Ser Asn His Trp Val lie 
-70 -65 -60 

AAT CCC CAG AAT AAT TTG GGC TTA CAG CTC TGT GCA GAA ACA GGG GAT 1529 
Asn Pro Gin Asn Asn Leu Gly Leu Gin Leu Cys Ala Glu Thr Gly Asp 
-55 -50 -45 -40 

GGA CGC AGT ATC AAC GTA AAA TCT GCT GGT CTT GTG GGA AGA CAG GGA 1577 
Gly Arg Ser lie Asn Val Lys Ser Ala Gly Leu Val Gly Arg Gin Gly 
-35 -30 -25 

CCT CAG TCA AAA CAA CCA TTC ATG GTG GCC TTC TTC AAG GCG AGT GAG ' 1625 

Pro Gin Ser Lys Gin Pro Phe Met Val Ala Phe Phe Lys Ala Ser Glu 
-20 -15 -10 

GTA CTT CTT CGA TCC GTG AGA GCA GCC AAC AAA CGA AAA AAT CAA AAC 1673 
Val Leu Leu Arg Ser Val Arg Ala Ala Asn Lys Arg Lys Asn Gin Asn 
-5 1 5 

CGC AAT AAA TCC AGC TCT CAT CAG GAC TCC TCC AGA ATG TCC AGT GTT 1721 



9 



Arg Asn Lys Ser Ser Ser His Gin Asp Ser Ser Arg Met Ser Ser Val 
10 15 20 25 

GGA GAT TAT AAC ACA AGT GAG CAA AAA CAA GCC TGT AAG AAG CAC GAA 1769 
Glv Asp Tvr Asn Thr Ser Glu Gin Lys Gin Ala Cys Lys Lys His Glu 

30 35 40 

CTC TAT GTG AGC TTC CGG GAT CTG GGA TGG CAG GAC TGG ATT ATA GCA 1817 
Leu Tyr Val Ser Phe Arg Asp Leu Gly Trp Gin Asp Trp He He Ala 
45 50 55 

CCA GAA GGA TAC GCT GCA TTT TAT TGT GAT GGA GAA TGT TCT TTT CCA 1865 
Pro Glu Gly Tyr Ala Ala Phe Tyr Cys Asp Gly Glu Cys Ser Phe Pro 
60 65 70 

CTT AAC GCC CAT ATG AAT GCC ACC AAC CAC GCT ATA GTT CAG ACT CTG 1913 
Leu Asn Ala His Met Asn Ala Thr Asn His Ala He Val Gin Thr Leu 
75 80 85 

GTT CAT CTG ATG TTT CCT GAC CAC GTA CCA AAG CCT TGT TGT GCT CCA 1961 
Val His Leu Met Phe Pro Asp His Val Pro Lys Pro Cys Cys Ala Pro 
90 95 100 105 

ACC AAA TTA AAT GCC ATC TCT GTT CTG TAC TTT GAT GAC AGC TCC AAT 2 009 

Thr Lys Leu Asn Ala He Ser Val Leu Tyr Phe Asp Asp Ser Ser Asn 
/ 110 115 120 

GTC ATT TTG AAA AAA TAT AGA AAT ATG GTA GTA CGC TCA TGT GGC TGC 2057 
Val He Leu Lys Lys Tyr Arg Asn Met Val Val Arg Ser Cys Gly Cys 
125 130 . 135 

CAC TAATATTAAA TAATATTGAT AATAACAAAA AGATCTGTAT TAAGGTTTAT 2110 
His 

GGCTGCAATA AAAAGCATAC TTTCAGACAA ACAGAAAAAA AAA 2153 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 54 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

Met His Leu Thr Val Phe Leu Leu Lys Gly He Val Gly Phe Leu Trp 
-316 -315 -310 -305 

Ser Cys Trp Val Leu Val Gly Tyr Ala Lys Gly Gly Leu Gly Asp Asn 
-300 -295 -290 -285 

His Val His Ser Ser Phe He Tyr Arg Arg Leu Arg Asn His Glu Arg 
-280 -275 -270 

Arg Glu He Gin Arg Glu He Leu Ser He Leu Gly Leu Pro His Arg 
-265 -260 -255 



Pro Arg Pro Phe Ser Pro Gly Lys Met Thr Asn Gin Ala Ser Ser Ala 
-250 -245 -240 

Tyr Asn Ala Glu Glu Asn Pro Glu Glu 
-235 -230 -225 

^f^-^^^ Ser Val Arg Ala Ser Leu Ala Glu Glu Thr Arg Gly Ala 

-215 -210 -205 

Arg Lys Gly Tyr Pro Ala Ser Pro Asn Gly Tyr Pro Arg Arg He Gin 
-200 -195 _j^gQ 

Leu Ser Arg Thr Thr Pro Leu Thr Thr Gin Ser Pro Pro Leu Ala Ser 
-185 -180 -175 

Leu His Asp Thr Asn Phe Leu Asn Asp Ala Asp Met Val Met Ser Phe 
-l^O -165 -160 

Val Asn Leu Val Glu Arg Asp Lys Asp Phe Ser His Gin Arg Arg His 
-155 -150 -145 

Tyr Lys Glu Phe Arg Phe Asp Leu Thr Gin He Pro His Gly Glu Ala 

-135 -130 -125 

Val Thr Ala Ala Glu Phe Arg He Tyr Lys Asp Arg Ser Asn Asn Arg 
-120 -115 -no'' 

Phe Glu Asn Glu Thr He Lys He Ser He Tyr Gin He He Lys Glu 
-105 -100 -95 

Tyr Thr Asn Arg Asp Ala Asp Leu Phe Leu Leu Asp Thr Arg Lys Ala 
-^0 -85 -80 

Gin Ala Leu Asp Val Gly Trp Leu Val Phe Asp He Thr Val Thr Ser 
-75 -70 _65 

Asn His Trp Val He Asn Pro Gin Asn Asn Leu Gly Leu Gin Leu Cys 
" " -55 -50 45 

Ala Glu Thr Gly Asp Gly Arg Ser He Asn Val Lys Ser Ala Gly Leu 
-40 -35 -30 

Val Gly Arg Gin Gly Pro Gin Ser Lys Gin Pro Phe Met Val Ala Phe 
-25 -20 -15 

Phe Lys Ala Ser Glu Val Leu Leu Arg Ser Val Arg Ala Ala Asn Lys 
10 -5 

Arg Lys Asn Gin Asn Arg Asn Lys Ser Ser Ser His Gin Asp Ser Ser 
^ 10 15 20 

Arg Met Ser Ser Val Gly Asp Tyr Asn Thr Ser Glu Gin Lys Gin Ala 

25 30 35 

cys Lys Lys His Glu Leu Tyr Val Ser Phe Arg Asp Leu Gly Trp Gin 
40 45 .50 

Asp Trp He He Ala Pro Glu Gly Tyr Ala Ala Phe Tyr Cys Asp Gly 
55 60 65 



Glu Cys Ser Phe Pro Leu Asn Ala His Met Asn Ala Thr Asn His Ala 
70 75 80 

lie Val Gin Thr Leu Val His Leu Met Phe Pro Asp His Val Pro Lys 
85 90 95 100 

Pro Cys Cys Ala Pro Thr Lys Leu Asn Ala lie Ser Val Leu Tyr Phe 
105 110 115 

Asp Asp Ser Ser Asn Val lie Leu Lys Lys Tyr Arg Asn Met Val Val 
120 125 130 

Arg Ser Cys Gly Cys His 
135 



(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: circular 

(ii) MOLECULE TYPE: cDNA to mRNA 

(iii) HYPOTHETICAL: NO 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens - 
(F) TISSUE TYPE: Human Heart 



(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Human heart cDNA library stratagene catalog 

#936208 

(B) CLONE: hH38 



(viii) POSITION IN GENOME: 
(C) UNITS: bp 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 8.. 850 



(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 427.. 843 

(ix) FEATURE: 

(A) NAME/ KEY: mRNA 

(B) LOCATION: 1..997 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 



GAATTCC GAG CCC CAT TGG AAG GAG TTC CGC TTT GAC CTG ACC CAG ATC 
Glu Pro His Trp Lys Glu Phe Arg Phe Asp Leu Thr Gin lie 
-139 -135 -130 



4 



CCG GCT GGG GAG GCG GTC ACA GOT GCG GAG TTC CGG ATT TAG AAG GTG 
Pro Ala Gly Glu Ala Val Thr Ala Ala Glu Phe Arg He Tyr Lys Val 
-125 -120 -115 -110 

CCC AGC ATC CAC CTG CTC AAC AGG ACC CTC CAC GTC AGO ATG TTC CAG 
Pro Ser He His Leu Leu Asn Arg Thr Leu His Val Ser Met Phe Gin 
-105 -100 -95 

GTG GTC CAG GAG CAG TCC AAC AGG GAG TCT GAC TTG TTC TTT TTG GAT 
Val Val Gin Glu Gin Ser Asn Arg Glu Ser Asp Leu Phe Phe Leu Asp 
-90 -85 -80 

CTT CAG ACG CTC CGA GCT GGA GAC GAG GGC TGG CTG GTG CTG GAT GTC 
Leu Gin Thr Leu Arg Ala Gly Asp Glu Gly Trp Leu Val Leu Asp Val 
-75 -70 -65 

ACA GCA GCC AGT GAC TGC TGG TTG CTG AAG CGT CAC AAG GAC CTG GGA 
Thr Ala Ala Ser Asp Cys Trp Leu Leu Lys Arg His Lys Asp Leu Gly 
-60 -55 -50 

CTC CGC CTC TAT GTG GAG ACT GAG GAT GGG CAC AGC GTG GAT CCT GGC 
Leu Arg Leu Tyr Val Glu Thr Glu Asp Gly His Ser Val Asp Pro Gly 
-45 -40 -35 -30 

CTG GCC GGC CTG CTG GGT CAA CGG GCC CCA CGC TCC CAA CAG CCT TTC 
Leu Ala Gly Leu Leu Gly Gin Arg Ala Pro Arg Ser Gin Gin Pro Phe 
-25 -20 -15 

GTG GTC ACT TTC TTC AGG GCC AGT CCG AGT CCC ATC CGC ACC CCT CGG 
Val Val Thr' &he Phe Arg Ala Ser Pro Ser Pro He Arg Thr Pro Arg 
-10 ■ -5.1 

GCA GTG AGG CCA CTG AGG AGG AGG CAG CCG AAG AAA AGC AAC GAG CTG 
Ala Val Arg Pro Leu Arg Arg Arg Gin Pro Lys Lys Ser Asn Glu Leu 
5 10 15 

CCG CAG GCC AAC CGA CTC CCA GGG ATC TTT GAT GAC GTC CAC GGC TCC 
Pro Gin Ala Asn Arg Leu Pro Gly He Phe Asp Asp Val His Gly Ser 
20 25 30 35 

CAC GGC CGG CAG GTC TGC CGT CGG CAC GAG CTC TAC GTC AGC TTC CAG 
His Gly Arg Gin Val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Gin 
- 40 45 50 

GAC CTT GGC TGG CTG GAC TGG GTC ATC GCC CCC CAA GGC TAC TCA GCC 
Asp Leu Gly Trp Leu Asp Trp Val He Ala Pro Gin Gly Tyr Ser Ala 
55 60 65 

TAT TAC TGT GAG GGG GAG TGC TCC TTC CCG CTG GAC TCC TGC ATG AAC 
Tyr Tyr Cys Glu Gly Glu Cys Ser Phe Pro Leu Asp Ser Cys Met Asn 
70 75 80 

GCC ACC AAC CAC GCC ATC CTG CAG TCC CTG GTG CAC CTG ATG AAG CCA 
Ala Thr Asn His Ala He Leu Gin Ser Leu Val His Leu Met Lys Pro 
85 90 95 



AAC GCA GTC CCC AAG GCG TGC TGT GCA CCC ACC AAG CTG AGC GCC ACC 
Asn Ala Val Pro Lys Ala Cys Cys Ala Pro Thr Lys Leu Ser Ala Thr 
100 105 110 115 



TCT GTG CTC TAG TAT GAG AGC AGO AAC AAC GTC ATC CTG CGC AAG CAC 817 
Ser Val Leu Tyr Tyr Asp Ser Ser Asn Asn Val lie Leu Arg Lys His 
120 125 130 

CGC AAC ATG GTG GTC AAG GCC TGC GGC TGC CAC TGAGTCAGCC CGCCCAGCCC 87 0 

Arg Asn Met Val Val Lys Ala Cys Gly Cys His 
135 140 

TACTGCAGCC ACCCTTCTCA TCTGGATCGG GCCCTGCAGA GGCAGAAAAC CCTTAAATGC 930 

TGTCACAGCT CAAGCAGGAG TGTCAGGGGC CCTCACTCTC GGTGCCTACT TCCTGTCAGG 990 

CTTCTGGGAA TTC 1003 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 281 amino acids 

(B) TYPE; amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 

Glu Pro His Trp Lys Glu Phe Arg Phe Asp Leu Thr Gin lie Pro Ala 
-139 -135 -130 -125 

Gly Glu Ala Val Thr Ala Ala Glu Phe Arg lie Tyr Lys Val Pro Ser 
-120 -115 -110 

lie His Leu Leu Asn Arg Thr Leu His Val Ser Met Phe Gin Val Val 
-105 -100 -95 

Gin Glu Gin Ser Asn Arg Glu Ser Asp Leu Phe Phe Leu Asp Leu Gin 
-90 -85 -80 

Thr Leu Arg Ala Gly Asp Glu Gly Trp Leu Val Leu Asp Val Thr Ala 
-75 -70 -65 -60 

Ala Ser Asp Cys Trp Leu Leu Lys Arg His Lys Asp Leu Gly Leu Arg 
-55 -50 -45 

Leu Tyr Val Glu Thr Glu Asp Gly His Ser Val Asp Pro Gly Leu Ala 
-40 -35 -30 

Gly Leu Leu Gly Gin Arg Ala Pro Arg Ser Gin Gin Pro Phe Val Val 
-25 -20 -15 

Thr Phe Phe Arg Ala Ser Pro Ser Pro lie Arg Thr Pro Arg Ala Val 
-10 -5 1 5 

Arg Pro Leu Arg Arg Arg Gin Pro Lys Lys Ser Asn Glu Leu Pro Gin 

10 15 20 

Ala Asn Arg Leu Pro Gly lie Phe Asp Asp Val His Gly Ser His Gly 
25 30 35 

Arg Gin Val Cys Arg Arg His Glu Leu Tyr Val Ser Phe Gin Asp Leu 



40 



45 



50 



Gly Trp Leu Asp Trp Val lie Ala Pro 
55 60 



Gin Gly Tyr Ser Ala Tyr Tyr 
65 



Cys Glu Gly Glu Cys Ser Phe Pro Leu 
70 75 



Asp Ser Cys Met Asn Ala Thr 
80 85 



Asn His Ala lie Leu Gin Ser Leu Val 

90 



His Leu Met Lys Pro Asn Ala 
95 100 



Val Pro Lys Ala Cys Cys Ala Pro Thr 
105 110 



Lys Leu Ser Ala Thr Ser Val 
115 



Leu Tyr Tyr Asp Ser Ser Asn Asn Val 
120 125 



lie Leu Arg Lys His Arg Asn 
130 



Met Val Val Lys Ala Cys Gly Cys His 
135 140 



(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3623 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(vii) IMMEDIATE SOURCE: 

(B) CLONE: pALBP2-781 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 2724.. 3071 

(ix) FEATURE: 

(A) NAME/KEY: terminator 

(B) LOCATION: 3 150.. 3 2 18 

(ix) FEATURE: 

(A) NAME/KEY: RBS 

(B) LOCATION: 2222.. 2723 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

GACGAAAGGG CCTCGTGATA CGCCTATTTT TATAGGTTAA TGTCATGATA ATAATGGTTT 60 

CTTAGACGTC AGGTGGCACT TTTCGGGGAA ATGTGCGCGG AACCCCTATT TGTTTATTTT 12 0 

TCTAAATACA TTCAAATATG TATCCGCTCA TGAGACAATA ACCCTGATAA ATGCTTCAAT 180 

AATATTGAAA AAGGAAGAGT ATGAGTATTC AACATTTCCG TGTCGCCCTT ATTCCCTTTT 240 

TTGCGGCATT TTGCCTTCCT GTTTTTGCTC ACCCAGAAAC GCTGGTGAAA GTAAAAGATG 3 00 

CTGAAGATCA GTTGGGTGCA CGAGTGGGTT ACATCGAACT GGATCTCAAC AGCGGTAAGA 3 60 



# # 

TTCCTTGAGAG TTTTCGCCCC GAAGAACGTT TTCCAATGAT GAGCACTTTT AAAGTTCTGC 420 

TTATGTGGCGC GGTATTATCC CGTATTGACG CCGGGCAAGA GCAACTCGGT CGCCGCATAC 480 

ftiCTATTCTCA GAATGACTTG GTTGAGTACT CACCAGTCAC AGAAAAGCAT CTTACGGATG 540 

GCATGACAGT AAGAGAATTA TGCAGTGCTG CCATAACCAT GAGTGATAAC ACTGCGGCCA 600 

ACTTACTTCT GACAACGATC GGAGGACCGA AGGAGCTAAC CGCTTTTTTG CACAACATGG 660 

C3GGATCATGT AACTCGCCTT GATCGTTGGG AACCGGAGCT GAATGAAGCC ATACCAAACG 720 

ACGAGCGTGA CACCACGATG CCTGTAGCAA TGGCAACAAC GTTGCGCAAA CTATTAACTG 780 

GCGAACTACT TACTCTAGCT TCCCGGCAAC AATTAATAGA CTGGATGGAG GCGGATAAAG 840 

XTGCAGGACC ACTTCTGCGC TCGGCCCTTC CGGCTGGCTG GTTTATTGCT GATAAATCTG 900 

GAGCCGGTGA GCGTGGGTCT CGCGGTATCA TTGCAGCACT GGGGCCAGAT GGTAAGCCCT 960 

CCCGTATCGT AGTTATCTAC ACGACGGGGA GTCAGGCAAC TATGGATGAA CGAAATAGAC 1020 

AGATCGCTGA GATAGGTGCC TCACTGATTA AGCATTGGTA ACTGTCAGAC CAAGTTTACT 1080 

CATATATACT TTAGATTGAT TTAAAACTTC ATTTTTAATT TAAAAGGATC TAGGTGAAGA 1140 

TCCTTTTTGA TAATCTCATG ACCAAAATCC CTTAACGTGA GTTTTCGTTC CACTGAGCGT 1200 

CAGACCCCGT AGAAAAGATC AAAGGATCTT CTTGAGATCC TTTTTTTCTG CGCGTAATCT 1260 

GCTGCTTGCA AACAAAAAAA CCACCGCTAC CAGCGGTGGT TTGTTTGCCG GATCAAGAGC 1320 

TACCAACTCT TTTTCCGAAG GTAACTGGCT TCAGCAGAGC GCAGATACCA AATACTGTCC 1380 

TTCTAGTGTA GCCGTAGTTA GGCCACCACT TCAAGAACTC TGTAGCACCG CCTACATACC 1440 

TCGCTCTGCT AATCCTGTTA CCAGTGGCTG CTGCCAGTGG CGATAAGTCG TGTCTTACCG 1500 

GGTTGGACTC AAGACGATAG TTACCGGATA AGGCGCAGCG GTCGGGCTGA ACGGGGGGTT 1560 

CGTGCACACA GCCCAGCTTG GAGCGAACGA CCTACACCGA ACTGAGATAC CTACAGCGTG 1620 

AGCATTGAGA AAGCGCCACG CTTCCCGAAG GGAGAAAGGC GGACAGGTAT CCGGTAAGCG 1680 

GCAGGGTCGG AACAGGAGAG CGCACGAGGG AGCTTCCAGG GGGAAACGCC TGGTATCTTT 1740 

ATAGTCCTGT CGGGTTTCGC CACCTCTGAC TTGAGCGTCG ATTTTTGTGA TGCTCGTCAG 1800 

GGGGGCGGAG CCTATGGAAA AACGCCAGCA ACGCGGCCTT TTTACGGTTC CTGGCCTTTT 1860 

GCTGGCCTTT TGCTCACATG TTCTTTCCTG CGTTATCCCC TGATTCTGTG GATAACCGTA 1920 

TTACCGCCTT TGAGTGAGCT GATACCGCTC GCCGCAGCCG AACGACCGAG CGCAGCGAGT 1980 

CAGTGAGCGA GGAAGCGGAA GAGCGCCCAA TACGCAAACC GCCTCTCCCC GCGCGTTGGC 204 0 

CGATTCATTA ATGCAGAATT GATCTCTCAC CTACCAAACA ATGCCCCCCT GCAAAAAATA 2100 

AATTCATATA AAAAACATAC AGATAACCAT CTGCGGTGAT AAATTATCTC TGGCGGTGTT 2160 



GACATAAATA CCACTGGCGG TGATACTGAG CACATCAGCA GGACGCACTG ACCACCATGA 2220 

AGGTGACGCT CTTAAAAATT AAGCCCTGAA GAAGGGCAGC ATTCAAAGCA GAAGGCTTTG 2280 

GGGTGTGTGA TACGAAACGA AGCATTGGCC GTAAGTGCGA TTCCGGATTA GCTGCCAATG 2340 

TGCCAATCGC GGGGGGTTTT CGTTCAGGAC TACAACTGCC ACACACCACC AAAGCTAACT 2400 

GACAGGAGAA TCCAGATGGA TGCACAAACA CGCCGCCGCG AACGTCGCGC AGAGAAACAG 2460 

GCTCAATGGA AAGCAGCAAA TCCCCTGTTG GTTGGGGTAA GCGCAAAACC AGTTCCGAAA 2520 

GATTTTTTTA ACTATAAACG CTGATGGAAG CGTTTATGCG GAAGAGGTAA AGCCCTTCCC 2580 

GAGTAACAAA AAAACAACAG CATAAATAAC CCCGCTCTTA CACATTCCAG CCCTGAAAAA 2640 

GGGCATCAAA TTAAACCACA CCTATGGTGT ATGCATTTAT TTGCATACAT TCAATCAATT 2700 

GTTATCTAAG GAAATACTTA CAT ATG CAA GCT AAA CAT AAA CAA CGT AAA 2750 

Met Gin Ala Lys His Lys Gin Arg Lys 
1 5 

CGT CTG AAA TCT AGC TGT AAG AGA CAC CCT TTG TAG GTG GAC TTC AGT 2798 
Arg Leu Lys Ser Ser Cys Lys Arg His Pro Leu Tyr Val Asp Phe Ser 
10 15 20 25 

GAC GTG GGG TGG AAT GAC TGG ATT GTG GCT CCC CCG GGG TAT CAC GCC 2846 
Asp Val Gly Trp Asn Asp Trp lie Val Ala Pro Pro Gly Tyr His Ala 

30 35 40 

TTT TAC TGC CAC GGA GAA TGC CCT TTT CCT CTG GCT GAT CAT CTG AAC 2894 
Phe Tyr Cys His Gly Glu Cys Pro Phe Pro Leu Ala Asp His Leu Asn 
45 50 55 

TCC ACT AAT CAT GCC ATT GTT CAG ACG TTG GTC AAC TCT GTT AAC TCT 2942 
Ser Thr Asn His Ala lie Val Gin Thr Leu Val Asn Ser Val Asn Ser 
60 65 70 

AAG ATT CCT AAG GCA TGC TGT GTC CCG ACA GAA CTC AGT GCT ATC TCG 2990 
Lys lie Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala lie Ser 
75 80 85 

ATG CTG TAC CTT GAC GAG AAT GAA AAG GTT GTA TTA AAG AAC TAT CAG 3038 
Met Leu Tyr Leu Asp Glu Asn Glu Lys Val Val Leu Lys Asn Tyr Gin 
90 95 100 105 

GAC ATG GTT GTG GAG GGT TGT GGG TGT CGC TAGTACAGCA AAATTAAATA 3088 
Asp Met Val Val Glu Gly Cys Gly Cys Arg 
110 115 

CATAAATATA TATATATATA TATATTTTAG AAAAAAGAAA AAAATCTAGA GTCGACCTGC 3148 

AGTAATCGTA CAGGGTAGTA CAAATAAAAA AGGCACGTCA GATGACGTGC CTTTTTTCTT 32 08 

GTGAGCAGTA AGCTTGGCAC TGGCCGTCGT TTTACAACGT CGTGACTGGG AAAACCCTGG 32 68 

CGTTACCCAA CTTAATCGCC TTGCAGCACA TCCCCCTTTC GCCAGCTGGC GTAATAGCGA 3 328 

AGAGGCCCGC ACCGATCGCC CTTCCCAACA GTTGCGCAGC CTGAATGGCG AATGGCGCCT 3 388 



GATGCGGTAT TTTCTCCTTA CGCATCTGTG CGGTATTTCA CACCGCATAT ATGGTGCACT 
CTCAGTACAA TCTGCTCTGA TGCCGCATAG TTAAGCCAGC CCCGACACCC GCCAACACCC 
GCTGACGCGC CCTGACGGGC TTGTCTGCTC CCGGCATCCG CTTACAGACA AGCTGTGACC 
GTCTCCGGGA GCTGCATGTG TCAGAGGTTT TCACCGTCAT CACCGAAACG CGCGA 

(2) INFORMATION FOR SEQ ID NO: 14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 115 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 

Met Gin Ala Lys His Lys Gin Arg Lys Arg Leu Lys Ser Ser Cys Lys 

Arg His Pro Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp Trp 

25 30 

He Val Ala Pro Pro Gly Tyr His Ala Phe Tyr Cys His Gly Glu Cys 

■30 4 0 



45 



Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala He Val 

55 60 

Gin Thr Leu Val Asn Ser Val Asn Ser Lys lie Pro Lys Ala Cys Cys 



80 



val Pro Thr Glu Leu Ser Ala He Ser Met Leu Tyr Leu Asp Glu Asn 

85 90 95 

Glu Lys val Val Leu Lys Asn Tyr Gin Asp Met Val Val Glu Gly Cvs 
100 105 110 ^ ^ 

Gly Cys Arg 
115 

(2) INFORMATION FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



3448 
3508 
3568 
3623 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 



# 



CATGGGCAGC TGAG 



14 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 41 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 
GAGGGTTGTG GGTGTCGCTA GTGAGTCGAC TACAGCAAAT T 41 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 38 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 
GGATGTGGGT GCCGCTGACT CTAGAGTCGA CGGAATTC 3 8 

(2) INFORMATION FOR SEQ ID NO: 18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 
AATTCACCAT GATTCCTGGT AACCGAATGC T - 31 

(2) INFORMATION FOR SEQ ID NO: 19: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 19 : 
GTGGTACTAA GGACCATTGG CTTAC 
(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 
CGACCTGCAG CCATGCATCT GACTGTA 
(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 
TGCCTGCAGT TTAATATTAG TGGCAGC 
(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 
CGACCTGCAG CCACC 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 81 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 



(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ IDNO:23: 
TCGACCCACC ATGCCGGGGC TGGGGCGGAG GGCGCAGTGG CTGTGCTGGT GGTGGGGGCT 
GTGCTGCAGC TGCTGCGGGC C 
(2) INFORMATION FOR SEQ ID NO: 24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 73 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 
CGCAGCAGCT GCACAGCAGC CCCCACCACC AGCACAGCCA CTGCGCCCTC CGCCCCAGCC 
CCGGCATGGT GGG 

(2) INFORMATION FOR SEQ ID N0:-25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 
TCGACTGGTT T 

(2) INFORMATION FOR SEQ ID NO: 26: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 



CGAAACCAG 



(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27 
TCGACAGGCT CGCCTGCA 
(2) INFORMATION FOR SEQ ID NO: 28: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 28 
GTCCGAGCGG 

(2) INFORMATION FOR SEQ ID NO: 29: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 29 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29 
CAGGTCGACC CACCATGCAC GTGCGCTCA 
(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30 
TCTCTCGACC TCGGAGGAGC TAGTGGC 



